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Strongly coupled models at the TeV scale often predict one or more neutral spin-one reso- 
nances (Z') which have appreciable branching fractions to electroweak bosons, namely the 
Higgs and longitudinal W and Z. These resonances are usually believed to have multi-TeV 
mass due to electroweak precision constraints, placing them on the edge of LHC discovery 
reach. Searching for them is made particularly challenging because hadronically decaying 
electroweak bosons produced at such high energy will appear very similar to QCD jets. In 
this work we revisit the possibility of discovering these resonances at the LHC, taking advan- 
tage of recently developed jet substructure techniques. We make a systematic investigation 
of substructure performance for the identification of highly Lorentz-boosted electroweak 
bosons, which should also be applicable to more general new physics searches. We then esti- 
mate the model-independent Z' discovery reach for the most promising final-state channels, 
and find significant improvements compared to previous analyses. For modes involving the 
Higgs, we focus on a light Higgs decaying to bb. We further highlight several other novelties 
of these searches. In the case that vertex-based 6-tagging becomes inefficient at high p?, we 
explore the utility of a muon-based 6-tag, or no 6-tag at all. We also introduce the mode 
Z' — > Zh — > (i/p)(bb) as a competitive discovery channel. 



I. INTRODUCTION 



In the next few years, expectations are high that the LHC will discover the Higgs boson, 
finally verifying all of the basic building blocks of the Standard Model. There is also much 
anticipation that this discovery will not signal the end of novel physics near the electroweak 
scale. However, given the lack of clear anomalies in existing flavor and electroweak precision 
measurements, a very real possibility is that the Standard Model with a light elementary 
Higgs will turn out to be a reasonably good description of LHC phenomenology at energies 
of less than about a TeV. Should this prove to be the case, after a few 10's of fb -1 of data 
attention will become highly focused on carefully testing the properties of the Higgs through 
a combination of coupling measurements lit] and investigations of W, Z, and h production 
at progressively higher energies |2|, 13[. One of the most important goals is to determine to 
what extent the Higgs is truly an elementary particle, versus a light state arising from a new 
composite sector. 

Like most scenarios for physics beyond the Standard Model, specific composite Higgs 
models are under a certain amount of tension between, on the one hand, satisfying the 
flavor and electroweak precision constraints and, on the other hand, reintroducing the fine- 
tuning which the models were invented to solve, though still at a much reduced level (see 
for reviews e.g. . The usual compromise is to choose a compositeness scale of several 

TeV. If this kind of scenario proves to be true, then one of the main hints that there is 
something unusual about the Higgs will be the production of multi-TeV resonances with 
large couplings to Higgs bosons. 

In this paper, we aim to make a detailed investigation of the LHC's sensitivity to such 
scenarios via the production of a neutral spin-1 resonance in qq annihilation. The existence 
of one or more Z' particles with these quantum numbers is rather ubiquitous in composite 
Higgs models, for example serving as the analogs of the p mesons of QCD or as higher 
states of an enlarged broken gauge sector. Owing to the large coupling with the Higgs, 
as well as with the (longitudinal) W and Z bosons, the decays of these Z' resonances will 
typically be dominated by the modes Zh and W + W~ . This is in stark contrast to more 
standard U(l)' models, which usually predict much larger relative rates into leptons and 
quarks. Observation of a Z 1 with large branching fraction to either of these electroweak 
boson modes would therefore serve as highly suggestive evidence for a composite Higgs 
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sector, whereas lack of an observation would place important constraints. 

The two intermediate states Zh and W + W~ can lead to a variety of final-state configura- 
tions. Despite the additional structure compared to, say, a simple dileptonic final state, these 
can be some of the most challenging search channels. Indeed, the flavor and electroweak 
data hint at the most difficult possible situation: the Higgs may be light and the Z' may be 
heavy. The former presents a difficulty since the Higgs decays then become dominated by 
bb, versus the more distinctive IW'*' jZZ^ modes of heavier Higgses. The latter suggests 
that Z' production rates will be small (e.g., 0(1 fb) for = 3 TeV 6|, |7|), and also that 
the secondary decay products of the highly energetic electroweak bosons will be Lorentz- 
boosted into tight, jet-like configurations in the detector. For leptonic decays of the W and 
Z bosons, this last observation does not necessarily pose a problem, beyond the usual loss of 
efficiency due to leptonic branching fraction. But for any boson decays involving hadrons, 
the distinction with a QCD-induced jet can become uncomfortably blurred. 

In the past several years, various techniques have been developed for dealing with highly 
boosted Ws, Zs, Higgses, and tops (e.g., js], B-l^]. For our present study with boosted 
electroweak bosons, we will utilize the procedure of js|. Originally, this was developed to 
aid in the identification of semi-boosted light Higgses in the high-p^ tail of Standard Model 
Higgsstrahlung production. The procedure adapts on an event-by-event basis to pick out 
hard substructures within a region of interest in the detector, while actively attempting to 
remove uncorrelated radiation which can degrade mass resolution. We will find that the 
ideas of 8j easily extrapolate into the truly high-boost regime for the Higgs, as well as to 
hadronic decays of boosted Ws and Zs. Along the way, we will find that the scale-invariant 
nature of the procedure offers a number of nontrivial advantages. 

Beyond the simple bb decays of light Higgses, as well as qq decays of electroweak gauge 
bosons, substructure-type techniques can also be applied to the other possible decay modes 
of a boosted Higgs, such as r + r~ and WW^ / Z Z^*\ Including these modes in a Z' — > 
Zh search offers several advantages. First, a discovery in multiple modes would bolster 
confidence that we are seeing a real signal. Second, broadening the set of Higgs decay 
modes also broadens the sensitivity over Higgs masses, and allows for increased sensitivity 
in cases with nonstandard Higgs couplings (e.g., gaugephobic (21 23] or fermiophobic 24]). 
Third, a measurement of Higgs branching fraction ratios gives us further clues regarding its 
identity (especially whether it is the same "Higgs" as the one discovered at lower energies) 
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and, in the Standard Model limit, can give us a secondary handle on the mass. In this paper, 
we focus on the specific case of a 120 GeV Higgs decaying to bb. We separately explore the 



decays h — > t + t~ and h — > WW^/ZZ^ in two companion papers 25|, |26|. (See also [2 
for an analysis of Z' — > Zh —> Z{ZZ) in the all-leptonic channel.) 

Given this restriction, we find that the most promising Z' discovery channels are 

• WW {lv){qq') 

• Zh^ (l + l~){bb) 

• Zh^ {vv){bb) 



The first two of these have been explored before in [6|, |28|, |29| . (The first channel was also 



3l|, 



studied in (3), [3^] in the context of weak boson fusion. We also point out a study in 
closely related to our second channel, of h — > ZZ — > {l + l~){qq) at rrih = 300-600 GeV 
using substructure methods.) By applying jet substructure techniques, we obtain significant 
improvements, for example achieving sensitivity to fb-scale cross sections for =3 TeV in 
W + W~ with only 100 fb -1 of data from a 14 TeV LHC. The last mode, utilizing the invisible 
Z decay in Zh, appears to have been largely underappreciated for resonance searches, but 
we find that it is quite competitive with the leptonic Z. We further explore the utility 
of 6-tagging the Higgs-jet in the two Zh modes, using a muon-based technique that should 
extrapolate robustly to high-p^. Application of such a tag appears to be beneficial for finding 
Z' bosons in roughly the 1-2 TeV mass range for the leptonic Z mode, and up to about 3 
TeV for the invisible Z. Beyond these masses, the background becomes so small that an 
untagged analysis is more sensitive. 

The paper is organized as follows. In section [TT] we take a more detailed look at the 
theoretical motivation for our study and classify the relevant Z' final states. In section IIHI 
we review jet substructure techniques, characterize their performance for tagging boosted 
electroweak bosons, and make a comparison with a more traditional jet-based approach. We 
present our model-independent discovery reach analyses in section IIVI Section [V] contains 
conclusions. We include discussions of pileup and detector modeling in appendices lAl and IB1 
respectively. 
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II. Z' THEORY AND FINAL-STATE CHANNELS 



A. Theoretical motivation 



For our purposes, the relevant couplings of a lone generic Z' can be parametrized as 1 

dnt = z',r with r = j2M^i+g' H (H^H+h.c.). (i) 

i 

Here, ipi stands for the chiral Standard Model (SM) fermions (in complete SU(2)l multiplets) 
and H for the Higgs doublet (neutral Higgs will be denoted by h). g[ and g' H are the respective 
coupling constants. The coupling gg controls the decay rate of the Z' into Zh and W + W~ 
by virtue of the Goldstone boson equivalence theorem. (We exclusively work in the limit 
Vfiz 1 3> mz-i Triw) Immediate consequences of this are that the rates into Zh and W + W~ are 
nearly equal, and the Zs and Ws produced in the decays are highly longitudinally polarized. 
More general models (as we will see below) can have more than one Z' which can mix 
with each other and with the Z, leading to 5 , [/(2)^-nonuniversalities in the couplings of the 
individual Z' mass eigenstates. However, the approximate equality of total Zh and W + W~ 
rates is usually maintained, as electroweak symmetry breaking (EWSB) is necessarily a small 
effect in the heavy Z' sector. 

Roughly speaking, Z' models can be categorized into two classes. In the first class, the Z' 
is realized as a gauge boson of a new gauge group broken near the TeV scale. (See 32] for a 
review.) In the second class, the Z' emerges from a new strongly-coupled sector which also 
generates the Higgs boson or directly breaks electroweak symmetry. One simple example 
of the former class is an admixture of U(l)y and U(1)b-l- Another typical example is an 



admixture of the U(l) x and U(l) 



various models of Technico 



and warped 5D models 48 
two classes. 



or 



33 



subgroups of E 6 . 2 The class of composite Z' includes 



41], composite Hig gs 421 446]. partial compositeness [47] . 



5lJ. The Little Higgs [4j, |52[ effectively lives in between these 



1 We neglect four-point couplings with the SM Higgs, as well as terms induced by replacing — s- £> M . 
(These are both important for a more complete, gauge- invariant treatment of Z 1 couplings to the SM gauge 
bosons.) The possibility of kinetic mixing with hypercharge is already incorporated in our paramctrization. 
The field redefinitions necessary to undo such a mixing will simply shift — > j"* + njy, where k is a 
(usually small) number proportional to the coefficient of the mixing term. 

2 The first example is automatically anomaly free without extra exotic fermions other than right-handed 
neutrinos. The second example requires new exotic fermions near the TeV scale. 
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The simple gauge Z' couples fairly democratically between fermions and Higgs, usually 
leading to much higher branching fractions into fermions simply due to multiplicity (as well 
as a factor of two enhancement for decays into fermions versus decays into scalars). In fact, 
when the lepton Yukawa couplings have SM-like minimal structure, and respect the U(l)' 
symmetry, we can make a sharper statement: 

BR(Z' Z+r) > BR(Z' -> Zh or W + W~), (2) 

where I is any individual species of lepton. If this relation is violated, we can infer that 
either the lepton Yukawa sector has non-minimal structure (such as multiple Higgs doublets 
or [/(l)'-breaking spurions) or the couplings of the Z' do not furnish a gauge symmetry. 
Note that for a hypercharge-sequential Z', the branching fraction to each species of lepton 
is 12%, whereas the branching fraction to Zh or W + W~ is 1.2%. For a B — L boson, 
the branching fraction to each lepton is 15%, and into electroweak bosons it is zero. It is 
important to realize that even if a resonance is first discovered in the modes (which 
are very straightforward experimentally), concurrent searches in Zh and W + W~ can yield 
very valuable information about the nature of the resonance, even in the case that nothing 
is found. 

The situation drastically changes if the both the Z' and the Higgs doublet are composite 
particles generated by a new strongly-coupled sector. In this case, the coupling to the Higgs 
doublet can be hierarchically larger than the couplings to the fermions, and the decays of the 
Z' become dominated by the modes Zh and W + W~ (and perhaps ti, as we will see). 3 On the 
other hand, the typically small couplings between the composite sector and the fundamental 
SM fermions can significantly reduce the production cross section via qq annihilation. Indeed, 
weak boson fusion is often taken as the most robust wav to produce composite resonances, 
but a very high price in effective luminosity (see, e.g., 3J). Nonetheless, realistic and well- 
motivated models have been identified where the competition between small production 
rate via qq and large decay rate into electroweak bosons can roughly compensate. A very 



3 It is possible in such composite scenarios that the Higgs boson proper, the neutral CP-even scalar com- 
ponent of the doublet, is not a well-defined particle. This is the case in Technicolor and other Higgslcss 
theories. The decays of the Z' may then become completely dominated by W + W~ (in close analogy 
with p° — > 7r + 7r~ in QCD), or by other light composites. As we ultimately perform a model-independent 
analysis in this paper, in particular an independent study of W + W~ , our results can also be applied to 
these cases if m z > > 1 TcV. 
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well-studied class of such models are the warped 5D constructions inspired by the Randall 



Sundrum (RS) model 



481 ] . with the SM fields propagating in the bulk 
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50( . According 



to AdS/CFT duality |53|, these can be interpreted as weakly-coupled holographic duals of 



large- N c 4D fie 
the TeV scale 



d theories at very strong coupling and which exhibit near-conformality above 



54. 



55]. 



As an illustrative example, we briefly review the minimal custodial RS model with bulk 
51 ]. (A more detailed review of the Z' phenomenology of this model appears in Q].) 



SM fields 



Implementing a gauged custodial symmetry in the bulk (the holographic dual of a global 
custodial symmetry of the strongly-coupled sector) prevents large shifts of the electroweak T 
parameter, and can be modified to better accommodate measurements of Z — > bb 56j. The 
complete bulk gauge symmetry of the electroweak sector is SU{2) L x SU{2) R x U(1)b-l, 
and boundary conditions are chosen so that only the SU{2)l x U{1)y subgroup has zero 
modes (which subsequently acquire mass through the usual Higgs mechanism). The higher 
KK excitations sample from the complete gauge group, with each KK level containing two 
W bosons and three Z' bosons. All of the bosons have nearly degenerate masses set by the 
5D geometry, and can be highly mixed after EWSB. 

In this scenario, the couplings of the Z' bosons to the SM fermions and the Higgs doublet 
have a natural geometrical interpretation. The couplings are determined by the overlaps 
among the wavefunctions along the fifth dimension. In order to solve the hierarchy problem 
as in the original RS, the Higgs profile is localized near (or confined to) the IR brane. Light 
fermions are localized near the UV brane, leading to small overlap with the Higgs, and 
therefore small 4D Yukawa couplings. Heavy fermions such as the top quark are localized 
toward the IR brane. The gauge zero modes are uniformly spread throughout the space, but 
their KK excitations are localized near the IR brane. Consequently, the couplings of Z' to 
light fermions are suppressed, whereas the couplings to the Higgs doublet (and top quarks) 
are enhanced. As the region near the UV brane represents non-composite physics in the dual 
description, whereas the bulk and the IR brane represent composite physics, the tail of the 
KK wavefunctions near the UV brane indicates the kind of non-composite/composite mixing 
that occurs between the photon and p° in QCD. This mixing can be significant enough to 
allow reasonable production rates in qq annihilation. 

However, mixing between the composite sector and light SM sector also leads to tension, 
as there will be associated shifts in the values of electroweak and flavor observables. In order 
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rom the S parameter, the mass of the first 
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581 ] . Incorporating constraints from 



to satisfy electroweak constraints, coming main 
gauge KK level should be at least 0(3 TeV) 
flavor and CP can push the mass even higher [59j. These observations tend to generalize 
beyond this specific model, especially regarding electroweak corrections. The implication is 
that if the Higgs is composite, then the physics associated with that compositeness may be 
hiding at the multi-TeV scale. For this particular model, a 2(3) TeV Z' can be produced 
in qq annihilation with a x BR(Zh or W + W~) of roughly 15(1.5) fb at a 14 TeV LHC fl. 
(This is, in fact, the dominant production mechanism.) 

While we have so far discussed two extreme cases, namely simple gauge U(l)' models and 
models where the Higgs and Z' are both composite particles, it is also possible to interpolate 
between the behavior of these two. A well-known example is the Little Higgs scenario j52 | 



(reviewed in (4]). Here, the Higgs is a composite pseudo-Goldstone boson arising from the 
breakdown of a partially-gauged global symmetry, and inherits special protection from one- 
loop mass corrections. The gauge sector contains two copies of 577(2) x £7(1) as a subgroup, 
of which our own electroweak group is a mixture. The orthogonal gauge group contains two 
Z' bosons. Unlike the Higgs, they are now elementary, but they becomes massive due to the 
dynamics of the strongly-coupled sector that generates the Higgs. The relative couplings of 
these Z' bosons with the Higgs and with the SM fermions depends on the mixing angles 
between the two copies of the gauge group, and can be continuously dialed. 4 Similar to 
the warped 5D models, electroweak constraints favor TeV-scale masses or higher for the Z' 



bosons of the Little Higgs (see, e.g., [60 



, |6jJ). To get a sense for what cross sections are 



possible, we cite the numbers for the "Littlest Higgs" [62[ (with cot^ = 0.5) studied in [28|: 
10(1.1) fb for m z > = 2(3) TeV at a 14 TeV LHC. 

In summary, we emphasize that many well-motivated models contain one or more Z' with 
appreciable coupling to the Higgs doublet, and therefore appreciable branching fraction into 
electroweak bosons. Such Z' are constrained by electroweak precision tests (as well as flavor 
measurements) to live in the mufti- TeV mass range. Through the analog of 7-p mixing, it 
is possible to produce these resonances at the LHC through qq annihilation, even if the Z' 
is a composite or belongs (mostly) to a gauge group that is decoupled from the light SM 



4 While the Z' bosons here represent simple broken gauge symmetries, the branching fractions into fermions 
and Zh/W + W~ can escape the constraint of equation ((2|) since the Yukawa couplings in these models 
contain spurion fields to balance the charges. 
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fermions. In the case that the Z' is a simple elementary gauge boson, we usually expect 
the branching fractions into electroweak bosons to be smaller than the branching fractions 
into fermions. However, even if this turns out to be the case, verifying that the rate into 
electroweak bosons is either absent or subleading would be very useful to test the gauge-like 
nature of the Z '. 



B. Possible final-state decay channels 

While we have been discussing only two options for the initial decay stage of the Z' 
(roughly equal rates into Zh and iy + iy _ ), the secondary decays of the electroweak bosons 
lead to a proliferation of possible final states. Matters are particularly complicated by the 
fact that we do not yet know the mass of the Higgs boson, nor even if its branching fractions 
for a given mass strictly follow those predicted by the SM. 

We can start by categorizing the final states from the well-understood W + W~ case. 
The available topologies (identical those well-known from tt) are dileptonic, Z+jets, and 
all-hadronic. (Dileptonic and Z+jets include cases with taus, though we do not explicitly 
investigate these here.) The dileptonic channel (5% BR) is in principle the cleanest, but 
suffers from low branching fraction and ambiguities in kinematic reconstruction. It was 



analyzed in js| and found to be less sensitive than Z+jets. Given this, and given that the 
current paper is focused on jet substructure applications, we do not re-analyze this channel. 
The Z+jets channel (30% BR) was also studied in js] (as well as in j29|), and was shown to 
be the most powerful. We investigate the search reach in this channel in subsection IIV Al 
Finally, the all-hadronic channel (45% BR) has high rate but faces overwhelming dijet 
background. Even applying two hadronic H^-tags to the event (using the substructure 
techniques of the next section) is not enough to combat this to the point where all-hadronic 
achieves a reasonable S/B ratio. 5 Therefore, we do not investigate it. 

The Zh case offers significantly more options. We can globally classify these using the h 



5 Since the Z+jets and all-hadronic branching fractions are comparable, all-hadronic can be competitive only 
if the single-tagged W+jet — » (/z/)+jet and double-tagged dijet background rates are also comparable. 
Equivalcntly, the ratio between the untagged background cross sections (W+jet/dijet) should be of order 
the W mistag rate or larger. We estimate this ratio of cross sections to be only 0.03% in our kinematic 
range of interest, more than two orders of magnitude too small, given the mistag rates we estimate in 
subsection IIII CI 



s 



decay mode, and more locally by also specifying the Z decay mode. 

The simplest option, and the one favored by a light SM-like Higgs (70% BR for rrih = 120 
GeV), is h — > bb. This two-body decay was explored in the highly boosted case in jf^O], and 
in the intermediate-boost case in js|. The bb system will be accompanied by the decaying 
Z, which can decay to Z + /~ (/ = e,\i, 7% BR), neutrinos (20% BR), t + t~ (3% BR), or jets 
(70% BR). The usual logic is to focus on the cleanest possible final-state: (l + l~)(bb), usually 
with one or two 6-tags. We will analyze this mode anew in subsection IIVB| applying jet 
substructure and replacing vertex-based 6-tagging, which performs uncertainly at high-py, 
with /i-based tagging or no tagging at all. 6 The invisible Z should be similarly clean, but has 
not been studied in detail before at high-pr- (It is mentioned in 6|, and studied explicitly 
at intermediate-boost in jg].) While complete kinematic reconstruction is impossible, the 
Z' mass can be approximated event-by-event by using 2pr(h). This has a Jacobian peak 
at rriz', which is further reinforced by spin terms in the decay matrix element. While the 
resonance peak here is inevitably broader than in the fully reconstructed leptonic case, and 
is therefore subject to higher background, this interplays nontrivially with the three times 
higher statistics. We study this mode in subsection IIV CI Taus we will not consider, as they 
are challenging and have very small rate. (However, we return to them in the context of 
Higgs decays below.) Finally, the case with a jetty Z faces the same kind of overwhelming 
background problem as all-hadronic W + W~ , and is at best extremely challenging. 7 We do 
not pursue it here. 

This rounds out the set of modes which we explore in this paper: (lv)(qq'), (l + l~)(bb), 
(ui>)(bb). We will pursue two other important o ptio ns for Higgs decays, h — > t + t~ and 
h — > WW^ /ZZ^*\ in a set of companion papers 



25. 



26|. 



III. JET SUBSTRUCTURE FOR BOOSTED ELECTROWEAK BOSONS 

The electroweak bosons produced in the decay of a multi-TeV Z' will be highly Lorentz- 
boosted. Consequently, their own decay products tend to become highly collimated into 

6 While we do not explicitly exploit this, our no-tagging analysis will also capture the signal from h — > gg. 

7 We might ask whether it could be recovered by demanding two hadronic electroweak boson tags and two 
6-tags on the Higgs side. If the 6-tag operates perfectly (100% tag, 0% mistag), this may not be so terrible. 
However, it seems unlikely that this mode can be made competitive with more realistic 6-tag numbers. 
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single "jets" — W^-jets, Z-jets, and /i-jets. On the one hand, this collimation can complicate 
searches, as these boson-jets can easily be confused with QCD-induced jets. E.g., the mode 
Z' — > WW — > (lv)(qq') will suffer from large, quasi-two-body W^+jets backgrounds Wq 
and Wg. Even modes with leptons within a jet (such as from boosted h —> t + t~ — > IX), 
will face backgrounds from ordinary jets containing bottom/charm production, lepton fakes, 
or VF/Z-strahlung. On the other hand, the internal kinematic configurations of boson-jets 
are still rather distinctive from QCD, and we can capitalize on this to improve discrimi- 
nation. Indeed, a variety of strategies have been developed to discriminate hadronic (or 
semi-hadronic) decays of boosted heavy particles from ordinary QCD jets utilizing jet sub- 
structure, i.e. the detailed distribution of particles and/or energy within a jet. In this 
section, we detail in what sense substructure techniques are beneficial for discriminating 
boosted W/Z/h decaying to two quarks, and how robust these techniques are to contamina- 
tion and detector effects. (We discuss new techniques which we develop for the even more 



distinctive case of boosted h — > t + t in 



25|, and boosted h WW^/ZZ^ in |26|.) 



The section is organized as follows. We first describe the basic utility of substructure 
methods and describe our nominal choice of algorithm. We then make a comparison with 
a more traditional jet-based analysis to further clarify the advantages of substructure. Fi- 
nally, we estimate the performance of boosted electroweak boson tagging, and discuss the 
robustness of our estimates against theory and detector modeling. 



A. Utility of substructure and the BDRS algorithm 

The simplest jet observable sensitive to substructure is the jet's mass. While the decay 
products of the boosted electroweak boson may end up within a single jet, this jet will still 
have a mass close to the electroweak boson mass, and the Z' resonance will show up as a 
localized bump in the joint distribution of event mass and jet mass. For backgrounds with 
QCD jets, the event mass and the jet mass will both tend to be peaked toward zero. 

Taking an even closer look at the energy distribution within a jet offers two immediate 
improvements. First, ordinary QCD jets with mass near, say, my/-, will preferentially acquire 
their mass via relatively soft emissions near the edge of the jet area (depending in detail 
on the jet definition and pt scale). This situation is in contrast to genuine electroweak 
bosons, which usually consist of two well-localized subjets with roughly democratic energy- 
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sharing. Substructure techniques 



a a 



12Ml5l] can identify the more QCD-like kinematic 



configurations and remove them from analysis, thereby improving signal-to-background. 
Second, by identifying the subjets corresponding to the two decay partons, we can restrict 
our boson reconstruction to include only the particles (tracks and calorimeter cells) near 
their cores, thereby ignoring wider-angle radiation in the jet. This tends to be dominated 
by uncorrelated sources, namely ISR, underlying event, and pileup. By refining the set of 
particles to those in the immediate vicinity of the subjets, the jet mass resolution can be 
optimized. 8 

To take advantage of both of these improvements, we focus here on the sub jet-finding 
technique developed by Butterworth, Davison, Rubin, and Salam (BDRS) js| and further 
investigated by ATLAS Our modes Zh — > {l + l ){bb) and Zh — > (vv){bb) were also 

studied in those papers (as well as the closely related Wh — > (lu)(bb)), at lower boosts and 
in the context of a Standard Model Higgsstrahlung search. The BDRS technique trivially 
extrapolates to boosted hadronic Ws and Zs, as well as to arbitrarily high energies, limited 
only by the detector's spatial resolution (a topic which we address in appendix IB1 . 

The algorithm runs as follows. An event is first iteratively clustered into large-radius 



"fat-jets" with the Cambridge/Aachen (C/A) jet algorithm [65|, |66|. After a fat-jet of in- 
terest is identified, it is iteratively declustered by working backwards through the clustering 
stages. At a given stage, a protojet j is split into progenitors ji and j 2 . The splitting is 
categorized by a fractional mass-drop measure max(mi, m 2 )/m 12 and a symmetry measure 
min(py 1 ,py 2 )Ai?f 2 /mf 2 . The algorithm stops, identifying two subjets, when the former is 
less than a threshold /i = 0.67, and the latter is greater than a threshold y cut = 0.09. If these 
criteria are not met, the more massive of j\ and j 2 is used as the j for the next iteration, 
and the less massive is thrown away. The algorithm continues searching down to arbitrarily 
small Ai?s and arbitrarily small protojet masses, until either suitable substructure is found 
or there are no more particles left to operate on. 

While we will be applying this algorithm in a new kinematic regime, we do not modify 
its original declustering parameters. Partially this is to maintain contact with the earlier 
literature, but more practically we do not find dramatic gains in signal/background discrim- 



A number of promising techniques have also been developed to s pec ifically clean a jet of uncorrelated 
radiation, independent of the identification of localized subjets 
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ination by changing parameters. Indeed, the fairly scale-free nature of the procedure leads 
to similar behaviors at all scales. We do note that after completing our studies with the 
original BDRS algorithm, we found that the mass-drop criterion appears to play almost no 
role. For all of our analysis samples, it is almost always strictly weaker than the symmetry 
criterion. 9 Of course, a more realistic study should more closely explore optimization of the 
procedure and its parameters. 

For our clustering radius, we use R = 1.4, corresponding to quasi-hemispheric fat-jets. 
This may seem unnecessarily, even counterproductively, large for a kinematic region where 
the interesting structure becomes smaller than a normal jet. However, we have found that 
this very large jet radius offers a number of advantages. First, choosing a jet radius based on 
the known ratio m\y/pT or the expected ratio ruh/pr would introduce a hidden dimensionful 
scale into the analysis, manifesting itself as a turn-off feature beyond mw or mh in the 
background's jet-mass spectrum. Choosing a large R pushes this feature away from the 
physical boson mass, flattening out the sidebands. Second, using a large R broadens the 
range of Z' masses over which our search could operate efficiently, down to mz> = 300 ~ 400 
GeV. Third, as discussed in subsection IIII CI using such a large fat-jet radius picks up on 
more global event information, which appears to be useful for discriminating electroweak 
boson jets from QCD jets. 

After finding subjets, BDRS further refine them using a technique called filtering. The set 
of particles constituting the two subjets found by the mass-drop/asymmetry procedure are 
reclustered with C/A, using a refined jet-radius parameter i?mt = min(0.3, Ai? su bj C ts/2). The 
hardest three of these refined subjets is kept and used to reconstruct the boson. Filtering has 
been shown to be important to achieve optimal mass resolution at moderate boost, where 
the initial two subjets can be quite large and quite contaminated by the underlying event 



and pileup (see (67| for a detailed study). As we transition to high boost, these subjets 
become smaller, and the effects of contamination quickly become much less dramatic. For 
simplicity, we do not use filtering in the present study, nor do we introduce pileup into 



9 Analyses run with the mass-drop criterion turned off yield kinematic distributions (both shape and nor- 
malization) for signal and background subjets that are nearly identical to analyses run with the mass-drop 
turned on. However, running with the mass-drop but without the symmetry criterion leads to significant 
reshaping of background, and in particular enhances the number of background events in the electroweak 
boson mass windows by a factor of 2 ~ 3. 
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our simulation samples. We separately address the utility of filtering for removing pileup 
(mainly relevant for the 1 TeV samples) in appendix |A] 



Before proceeding, we also point out the possible utility of jet "superstructure" [68] 
for discriminating boosted electroweak bosons from QCD jets. The shower products of a 
boosted electroweak boson tend to be very well-collimated, at the AR scale of the two 
daughter quarks, as there is no color connection with the rest of the event. QCD jets, on 
the other hand, can shower at all angles, often with bias towards one of the beams. This 
can lead to differences in the large-angle energy patterns. While simple measures of this 



difference lead to only modest discrimination 68|, [69 U_i t is possible that a more sophisticated 
analysis could yield a more powerful discriminator 70[. We will see in subsection IHI CI that 
our choice of quasi- hemispheric jet clustering already incorporates some aspect of this more 
global discrimination. 

B. Comparison with traditional jet analysis 

We attempt to further clarify the utility of the BDRS procedure in heavy Z' searches 
by performing a simple particle-level comparison with a more tradition al j et-based analysis. 



Analyses of this latter type have previously been investigated in [6|, |28|, 29]. We use for 
this comparison events from our Z' — > WW — > {lv){qq') signal and combined W(q/g) 
background. 10 Our BDRS-style analysis operates on the hardest R = 1.4 fat-jet found in 
each event. The traditional jet analysis clusters the event into R = 0.4 anti-fcr jets. To make 
the two analyses as similar as possible, in the traditional analysis we first find the hardest jet 
and then seek out the next-hardest jet within R — 1.4. If the pairing is not too asymmetric, 
in the sense of the BDRS y cut , these two jets are kept as the "subjets" constituting the 
hadronic W candidate. (As in the substructure analysis, the mass-drop criterion is almost 
always satisfied.) Failing this, it is assumed that the W decay products have merged into 
one jet, and only the initial hardest jet is kept. In both analyses, we only consider relatively 



10 Details of the simulations and reconstruction can be found in section llVl though there are some important 
differences in the present analysis. No detector model is applied, and particle energies are not smeared. 
(Including these effects would not significantly alter our conclusions.) Also, unlike the full WW analysis 
in section IIV1 a jet veto is not applied here, as the sample of jets in the two analyses are very different, 
and we do not consider top backgrounds. 
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m z . = 1 TeV 



m z . = 3 TeV 




m™° (GeV) m™° (GeV) 

FIG. 1: Distributions of the (particle-level) reconstructed hadronic W mass for 1 and 3 TeV Z' — )• 
WW — > (lv)(qq'). Displayed are the nominal BDRS procedure (solid black), traditional dijets 
(dashed blue), traditional monojets (dashed red), and combined dijet+monojet (solid purple). 



central hadronic W candidates, with \rj\ < 1.5. 

The hadronic W mass reconstruction is shown in Fig.[T]for 1 and 3 TeV Z' . It is clear that 
the impact of switching to a traditional analysis is modest. Three features are worth noting. 
First, at 1 TeV we see a healthy admixture of traditional dijet and monojet reconstructions, 
whereas at 3 TeV monojet (unsurprisingly) dominates. Second, the traditional analysis is 
somewhat more efficient (by about 1.5% at 1 TeV and 7% at 3 TeV), due to the lack of 
internal phase space cuts in the monojet mode. Third, the 3 TeV monojet sample shows 
some broadening due to underlying event contamination, since the separation between W- 
subjets is becoming small compared to the jet radius. 

The backgrounds are shown in Fig. expressed as slices of the double-differential cross 
section in the (m^> co , m™°) plane at m^, co = 1 and 3 TeV. Again, 1 TeV events reconstructed 
near the W mass are a mixture of traditional dijet and monojet, and each has a very distinct 
shape. In the W mass region, the sum is nonetheless quite comparable to the BDRS result. 
At 3 TeV, the traditional analysis is again dominated by monojet. However, the effects 
of the internal kinematic cuts of the BDRS procedure are now evident, as the background 
processed through BDRS is roughly two times smaller, and significantly flatter. 

While the advantage of BDRS over more traditional jet analysis is very clear in the 3 TeV 
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FIG. 2: Slices of the showered leading-order double- differential cross section for W +jets — > 
{lv)+jets vs (particle-level) reconstructed W mass, fixing the reconstructed Z' mass at 1 and 3 
TeV. Displayed are the nominal BDRS procedure (solid black), traditional dijets (dashed blue), 
traditional monojets (dashed red), and combined dijet+monojet (solid purple). 



background, the 1 TeV background illustrates a more subtle potential advantage. In the 
traditional analysis, we inevitably face the issue of combining dijet and monojet analyses. 
There is a broad crossover region between dijet-dominance and mono jet-dominance as we 
scan through potential Z' masses. 11 In contrast, the BDRS procedure is largely free of any 
distance or energy scales beyond the fat-jet clustering size. It operates smoothly and (mostly) 
without dimensionful thresholds. This feature may be advantageous, both in simplifying 
the analysis and in avoiding possible systematic effects on the background shape in the 
(m^> co , m^, co ) plane. The relative flatness over m^ co may also be seen as an advantage. The 
background processed by BDRS is only steeply falling along the m^, co direction, possibly 
allowing a signal bump to stand out more clearly in the mass plane than in the more 

11 The detailed behavior of this crossover depends on the jet definition and cuts. For example, when we 
focus on the region m^, co = 1 TeV, the reconstructed Ws will have pr ~ 500 GeV. In our dijet analysis, 
the minimum AR is 0.4, and the minimum y is y cu t = 0.09. The minimum dijet mass can be roughly 
approximated as \/0.09(0.4)(500 GeV) = 60 GeV, which corresponds reasonably well with the left panel 
of Fig. [5J Note that changing (or removing) the symmetry cut will simply move the location of the 
crossover. Taking a larger R for the jet is another option. However this subjects the reconstructed Ws to 
more contamination, and is less discriminating against highly asymmetric QCD configurations. 
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FIG. 3: Reconstructed W-jet (left) and h-jet (right) invariant mass distributions for Z' masses of 
1 TeV (black), 2 TeV (blue), and 3 TeV (red). The rescaled ECAL model of appendix W\ has been 
applied. 
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59 
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TABLE I: Tag rates (in percent) for boosted electroweak bosons. (Absolute statistical errors are of 
order 2%.) 

traditional analysis. 



C. Performance and robustness 



We now categorize the performance of the BDRS procedure for discriminating boosted 
W/Z/h. To make our estimates somewhat more realistic, we apply a simple detector model, 
which is described in detail in appendix [B] This model hybridizes ECAL spatial resolution 
with ECAL+HCAL energy measurements, and introduces rudimentary energy smearing. 
The simple trick of using the ECAL to trace the energy flow significantly decreases the 
impact of detector granularity at very high p?. Event reconstruction follows section IIV1 

For the signal, we use VF-jets and (non-fe-tagged) /i-jets reconstructed from the Z' — > 
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px ^ 500 GeV 


p T ~ 1000 GeV 


px — 1500 GeV 


quark - 


> W 


6.5% 


6.5 


5.9 


quark - 


-> h 


b.8 


o.b 


5.8 


gluon — 


> W 


10.4 


8.3 


7.4 


gluon - 


■» h 


10.5 


8.8 


7.4 



TABLE II: Mistag rates (in percent) for quarks and gluons faking boosted electroweak bosons. 
(Absolute statistical errors are of order 0.3%.) 

WW —> (lu)(qq') and Z' — > Zh — > (l + l~)(bb) samples, respectively. (We do not present 
explicit numbers for Z-jets, but these will be very similar to W^-jets.) The boosted bosons 
produced in the Z' decays are strongly peaked toward px = m Z '/2, and are usually quite 
central. The tag rate is defined as the fraction of reconstructed events where the summed 
boson subjets fall within a prespecified mass window chosen by eye: m su bj e t s = [65, 95] 
GeV for Ws, and m su bj e ts = [100, 140] GeV for Higgses. 12 The reconstructed W and Higgs 
mass distributions are shown in fig. [3J and tag rates are displayed in table [D Qualitatively, 
we can see that the rates are 0(1), are weak functions of px, and that the Higgs rates 
tend to be smaller. This last feature comes from the fact that the higgs is scalar, versus a 
longitudinal spin-1 boson, and that the mass peak is slightly degraded by neutrinos from 
semileptonic bottom and charm decays. It is also worth noting that the the tag rates for 
transverse W would be smaller than those for longitudinal, though we have not investigated 
this quantitatively. 13 

The mistag rates are displayed in tableHIl for Z(q/g) backgrounds generated in p? bins of 
[400,600] GeV, [800,1200] GeV, and [1200,1800] GeV. (W(q/g) simulations yield identical 
results.) These rates are also fairly weak functions of px, and are similar for our W and 
Higgs mass windows. Quarks fake heavy bosons less often than gluons, as their probability 
for radiating is smaller. The final numbers are about 6-7% for quarks, and 7-11% for gluons. 

12 Sometimes a boosted boson will fail reconstruction because the BDRS procedure deconstructs its fat-jet 
down to a single calorimeter cell. We have checked that this is very rare for signal, at the few percent 
level at most. It can be more important for QCD fat-jets, which tend to have less substructure. The loss 
rate in that case is still typically less than 10%. Effectively, these represent an additional contribution to 
the zero-mass bin which we have not accounted for. 

13 Determining this would be important, for example, for W bosons produced in decays of spin-0 or spin-2 
resonances. 
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Interestingly, the mistag rates show a decreasing trend as we increase p?, particularly for 
gluons. This may be indicating some of the global color effects mentioned in subsection IIIIA1 
In particular, our very large fat-jet size will pick up FSR emissions at wide angles, which are 
present for QCD jets but not for electroweak boson jets. This hardens the background jet- 
mass spectrum, sending some fraction of background events beyond the electroweak boson 
mass windows. To get a better sense for what this implies, we re-ran the W-tag analysis 
on the mz' = 3 TeV WW signal and pt = [1200, 1800] backgrounds with R = 0.3. This jet 
size is just large enough to contain the subjets from almost all of the signal events within 
our W mass window, given the value of y cut . We find that the signal efficiency increases 
slightly, from 72% to 74%. This represents some recaptured events, which were lost in the 
quasi-hemispheric analysis due to high-mass combinations where one subjet is the entire 
boosted W and the other is an ISR jet. (The original fraction of such "bad" combinations 
is at the 5% level for all Z' samples.) However, the quark and gluon mistag rates increase 
by a factor of roughly 1.7, to 9.4% and 13.0%, respectively. Optimization of the fat-jet 
and sub jet-finding procedures to best capitalize on this physics is an interesting topic which 
we defer to future work. Still, it is already clear that using smaller jets at higher pt may, 
counterintuitively, be detrimental. 

We have also investigated the theoretical robustness of these mistag rate estimates by 



comparing our results against samples generated with HERWIG v6.510 71fl and MadGraph/ 



MadEvent+PYTHIA v4.4.44 



721 ] with jet-parton matching. The HERWIG samples are of iden- 



tical composition to the default PYTHIA samples. They display no statistically significant 
difference in any of the mistag rates. The MadGraph sample consists of Z+jets events MLM- 
matched [73} up to three jets using p T and AR thresholds of 30 GeV and 0.2, respectively. 
We have generated the pt = [400, 600] GeV bin, and find rather good agreement with the 
PYTHIA and HERWIG results. 14 The mistag rates from MadGraph do appear to be 10-20% lower, 
relatively speaking, but this would have to be confirmed with higher-statistics running. 
Detector modeling (see appendix |Bj) is largely unimportant for the mistag rates given 



14 When working at higher-order it becomes somewhat ambiguous whether a jet originates from a quark 
or a gluon. As an operational definition, we search out hard progenitor partons within AR = 1.4 of the 
main fat-jet axis, tallying individual (signed) quark flavors. Jets with leftover quark-number are deemed 
"quark jets," and are otherwise deemed "gluon jets." This procedure yields relative "Zq" and u Zg" cross 
sections quite comparable to what is obtained with simple 2 — > 2 matrix elements. 
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fixed jet-mass windows, as the QCD spectrum processed through BDRS is not sharply fea- 
tured. We have explicitly checked that eliminating the calorimeter geometry model and/or 
the energy smearing has no significant effect. The jet-mass distributions for signal are of 
course much more sensitive. Practically, we can parametrize our ignorance with a single 
Pr-dependent factor: the amount by which we have mis-estimated the jet-mass resolution. 
If a more realistic estimate tells us that we must expand our mass windows by a factor x 
to accept the same signal, then our mistag rates will increase by approximately the same 
factor (since the background spectra are fairly flat). Signal significance will scale like 1/y/x, 
and signal-to-background like 1/x. Assuming that x is an 0(1) number, then our estimates 
for Z' reach should also be good to 0(1). 



IV. Z' SEARCHES 



In this section, we estimate the discovery reach for Z' resonances decaying to W + W~ 
and Zh, applying the jet substructure techniques in section IIHI As discussed in section [Til 
Z+jets is the most promising, final state for W + W~, and we investigate this mode. This has 
previously been studied in p, |29|. For Zh, we focus exclusively on the case of a light Higgs 
(120 GeV) decaying to bb. The channel with associated leptonic Z decay has traditionally 
been considered the most promising jf], l^sj] (see also 31]). and we also investigate this mode. 
The channel with invisible Z has been mentioned in p], but not investigated in detail. 
This is the final mode which we study here. Perhaps not without coincidence, these three 
topologies — W + W~ -> {lv)(qq') } Zh -»■ (l + l~)(bb), and Zh ->■ {vv){bb) — are nearly the 
same as those studied in [8[, though with Wh replaced by WVy. 15 

The Z' samples for this study are generated with 2— >4 matrix elements in MadGraph/ 
MadEvent v4.4.32 72[ and showered/hadronized in PYTHIA (plugin version 2.1.3). The 
Z' width is taken to be 3%, which is smaller than instrumental width and the signal windows 
that we use below. (Our estimates should work reasonably well for resonances up to about 
15% natural width.) Our background samples are generated with PYTHIA v6.4.11 [t4 | 



15 We note that our results for WW can be translated with minor modifications to the case of W — > Wh — > 
(lv)(bb) without 6-tags. Our non-tagged Zh results can also be translated for searches for ZZ — > (l + l~)(qq) 
and/or ZW — > (l + l~){qq'), though some care should be taken in accounting for spin effects. 
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with default settings. 16 All processes are evaluated at leading-order. 17 As discussed in 
appendix [Aj we do not incorporate pileup into the analysis, but have checked that its effects 
would be quite manageable. The final-state particles are processed through a primitive 
ECAL/HCAL hybrid calorimeter model, described in detail in appendix [Bj to capture the 
main spatial resolution effects which we face in reality. The resulting calorimeter cells are 
then clustered/declustered using Fast Jet 2.4.1 ^7^. Energy smearing is applied to the 
final reconstructed objects (leptons and subjets). Subjet energy smearing is described in 
appendix [HI Electrons are smeared by 2%, and muons by (5%)v/£/TeV. 

We estimate discovery reach for 1, 2, and 3 TeV Z' by using simple counting. For each 
sample, we construct a box in the (m^/h' m v ^? ) plane, centered on the signal. The box is 
not optimized, other than coarsely by eye. We claim that discovery is possible if two criteria 
are met: N s /^/7^ > 5 and N s > 10. 

Several of our reconstruction criteria are common between all of the analyses. We accept 
leptons (e and fi) with p T > 30 GeV and \r]\ < 2.5. The leptons are required to be isolated 
from surrounding hadronic activity (neglecting photons and other leptons) within a cone 
of R = 0.4: Pt(0/(pt(0 + pr(hadrons)) > 0.9. Jets are constructed, as described in 
section [TTTl using the Cambridge/Aachen algorithm with R = 1.4. We consider jets with 
Pt > 20 GeV and \rj\ < 2.5. The leading jet, which we take to be the hadronic electroweak 
boson candidate, must have pt > 200 GeV and \rj\ < 1.5. (The last requirement forces the 
candidate into the better-segmented barrel region of the detector.) This jet is declustered, 
and the two subjets used to reconstruct the boson. 



A. W + W in Z+jets mode: (lv)(qq ! ) 

Our basic selection cuts are as follows. We demand exactly one isolated lepton, which 
should be located in the semicylinder opposite the hadronic W candidate jet. We model 
missing energy by taking the transverse vector that balances the (energy-smeared) lepton 



16 pythIA accounts for spin effects in the electroweak boson decay distributions for these backgrounds. 
However, they would not be accounted for in Z' xlficays. 
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The NLO QCD X-factor for the Z' signal is 1.3 [75|. The QCD ^-factor for high-p T W/Z+jets, which is 
our dominant background in all cases, is 1.6 The net effect of these corrections on S/y/B is therefore 
small. Electroweak corrections for high-pr processes can also be significant 77 1- These tend to reduce 
the W/Z+jets rate. 
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FIG. 4: Reconstructed Z' invariant masses for 1, 2, and 3 TeV in WW —> {lv){qq'). 
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13.0 fb 
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a(tt) 




23 fb 


0.57 fb 


0.011 fb 


a(WW) 




3.77 fb 


0.33 fb 


0.042 fb 


a(WZ) 




1.01 fb 


0.068 fb 


0.0068 fb 


a(Wh) 




0.029 fb 


0.0017 fb 


0.00013 fb 



TABLE III: Signal efficiency and background cross sections after all cuts in WW — > {lu){qq'). 
Note that the signal efficiency includes the branching fractions for the W decays. 

and two subjets. We then approximately reconstruct the complete leptonic W boson by 
exploiting the quasi-collinearity of the lepton and neutrino: r] p = r]i. After this step, we can 
construct the complete event mass, m^, co . To reduce VT+jets background, which tends to 
be produced at shallower angles than the signal for a given s, we demand that the hadronic 
W satisfy pt > m^, co /3. Finally, in order to control top backgrounds, we apply a jet veto: 
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Luminosity 


= 1 lev 


mz 1 = 2 lev 


mz' = <j lev 


£ = 30 fbT 1 


58.3 fb 


9.02 fb 


2.70 fb 


£ = 100 fbT 1 


31.9 fb 


4.94 fb 


1.48 fb 


C = 300 fbT 1 


18.4 fb 


2.85 fb 


0.85 fb 


S/B = 1 


501.5 fb 


15.44 fb 


1.36 fb 



TABLE IV: a(Z') x BR(Z' -> W+VF") required for discovery or S/B = 1 in the (lv)(q?) mode. 
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FIG. 5: Discovery reach for WW — > (lv)(qq f )- The three symbols (diamond, box, circle) correspond 
to (30, 100, 300) fb -1 of integrated luminosity. The x refers to the value that is required to achieve 
S/B = 1. 



1000 - 




the subleading jet in the event (if found) should satisfy px < pr(0/2- 18 The reconstructed 
Z' resonance, after application of all cuts, is shown in figure HI 

Our signal box is m^? = [65,95] GeV and m^, co = [mz> — 15%, mz' + 15%]. The final 
signal efficiencies and background cross sections are shown in table IIHI The discovery reach, 
i.e. the minimum Z' cross section times BR{W + W~) required to claim discovery, is listed 
for integrated luminosities of 30, 100, and 300 fb -1 in table HV] and displayed graphically in 
fig We also list /display the a x BR required to achieve S/B = 1. 



18 This is a very simplistic cut, with many caveats. In particular, since the jets we use are very large, pileup 
can significantly increase the subleading jet's momentum. However, even a slightly more sophisticated 
analysis could likely work around this, for example reclustering with a smaller jet size or trimming the 



fat-jet [63(. We suspect that a more dedicated top "anti-tag," for example attempting to fully reconstruct 
and veto the leptonic and/or hadronic top, could perform even better. We also note that a jet veto of 
this type rejects W+jets backgrounds generated in the soft VK-strahlung-like regions of phase space. 
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FIG. 6: Reconstructed Z' invariant masses for 1, 2, and 3 TeV in the Zh — > )(bb) mode. 



As one benchmark to interpret the results, we point out that a 3 TeV Z' with a x BR ~ 
1.5 fb, such as the warped 5D case discussed in section [III can be discovered with about 
100 fb -1 of data (and with 0(1) S/B ratio). Previous estimates had placed discovery at the 
ab^ 1 scale jfi] or higher 29]. 



B. Zh with leptonic Z: (bb) 

For this analysis, we reconstruct the leptonic Z by demanding exactly two isolated leptons 
with opposite-sign and same-flavor, and which fall into the mass window mi+i- = [80, 100] 
GeV. These should be within |A0| < 7r/2 of each other, and the leptonic Z reconstructed 
from them should be | A0| > tt/2 away from the Higgs candidate jet. The Z' is reconstructed 
as a simple four-vector sum of the leptonic Z with the Higgs-subjets. As with the W + W~ 
analysis above, we demand that the Higgs candidate satisfy pt > m^, co /3. Figure [6] displays 
the reconstructed resonance peaks after application of cuts. 

Searches for a light Higgs decaying to bb traditionally rely heavily on 6-tagging. However, 
the quality of vertex-based 6-tagging is expected to quickly degrade at high p T , and we do 
not here have the ability to model this. Nonetheless, we can ask whether a more robust 
form of 6-tagging can be applied, or whether 6-tagging needs to be applied at all. In lieu 
of capitalizing on the displaced vertices of B hadrons, we use the well-exploited fact that 
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mz' = 1 TeV 


mz' = 2 TeV 


mz' = 3 TeV 


Signal Eff. 


1.3% (0.39%) 


1.7% (0.53%) 


1.7% (0.57%) 


a(Zj : ->■ 5Z) 

a(Z(W/Z)) 

a{Zh) 


5.68 fb (0.16 fb) 
1.52 fb (0.09 fb) 
0.093 fb (0.009 fb) 
0.046 fb (0.013 fb) 


0.18 fb (0.0066 fb) 
0.067 fb (0.0053 fb) 
0.0042 fb (0.00011 fb) 
0.0024 fb (0.0007 fb) 


0.014 fb (0.0006 fb) 
0.006 fb (0.0005 fb) 
0.00044 fb (0.000014 fb) 
0.00025 fb (0.000076 fb) 



TABLE V: Signal efficiency and background cross sections after all cuts in Zh — > (l + l~)(bb). Note 
that the signal efficiency includes the branching fractions for the Z and h decays. The numbers in 
parentheses are after \i-tagging. 



Luminosity 


mz 1 = 1 TeV 


mz 1 = 2 TeV 


mz 1 = 3 TeV 


C = 30 fb" 1 
C = 100 fb" 1 
C = 300 fb" 1 
S/B = 1 


186.2 fb (121.9 fb) 
102 fb (66.8 fb) 
58.9 fb (38.6 fb) 

552.6 fb (69.8 fb) 


27.8 fb (62.4 fb) 

15.2 fb (18.7 fb) 
8.8 fb (6.2 fb) 

15.3 fb (2.4 fb) 


19.7 fb (59.0 fb) 
5.9 fb (17.7 fb) 
2.4 fb (5.9 fb) 
1.2 fb (0.2 fb) 



TABLE VI: a(Z') x BR(Z' -> Zh) required for discovery or S/B = 1 in the (l + l~){qq') mode. The 
numbers in parantheses are after /i-tagging. 

they have fairly high semi-leptonic branching fractions, specifically to the very clean case 
of muons. A muon can be produced either promptly from the b quark decay, or from a 
subsequent charm decay, leading to a net BR of about 20% for B — > /i+anything. Since 
the Higgs decay gives us two chances to find one of these decays, our net tag rate for > 1 
muon embedded within the Higgs-subjets should approach 35%, independent oipx- Physical 
backgrounds to this include prompt heavy flavor (b- and c-jets), gluons splitting to heavy 
flavor within light QCD jets, and decays-in-flight of pions and kaons (which we include in 
our PYTHIA simulations). We implement the tag by searching out muons with > 5 GeV 
within cones of size R = Ai? su bj C ts around each subjet. In practice, we find tag rates of 
roughly 30%, and mistag rates at the level of 3-8%, increasing slighly with pt and generally 
somewhat higher for gluons. There will also be backgrounds from fake muons, but we assume 
that these can be brought to a level below the physical mistag rate. 

We run two concurrent analyses, one without a fe-tag and one with the /x-based fe-tag. 
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FIG. 7: Discovery reach for Zh — > (/ + /~)(66) without (left) and with (right) \x-tagging. The three 
symbols (diamond, box, circle) correspond to (30, 100, 300) fb" 1 of integrated luminosity. The x 
refers to the value that is required to achieve S/B = 1. 

The signal box is mj? co = [100, 140] GeV and m r |, co = [m z > - 15%, m z > + 15%]. The final 
signal efficiencies and background cross sections are shown in table |V] The discovery reach 
is listed in table IVI[ and displayed graphically in fig [TJ along with the a x BR required to 
achieve S/B = 1. 

We can see that the statistical reach and S/ B are clearly improved by the //-tag for lower 
masses, but it becomes overkill at higher masses, where the untagged background is only 
0(1) events even at 300 fb" 1 . For 2 and 3 TeV, the tagged discovery reach is essentially 
controlled by the N$ = 10 requirement, with near-vanishing background. Clearly, 6-tagging 
is most important at sub- TeV masses, where the background can still be substantial. For- 
tuitously, sub- TeV masses are exactly where traditional 6-tagging techniques would operate 
well. But we learn here that 6-tagging in this mode at multi-TeV mass may in fact be 
counterproductive, at least from a strictly statistical standpoint. 

Using the same benchmark model as in the previous subsection, 3 TeV and 1.5 fb _1 , we 
find that discovery would require a bit less than 400 fb _1 . This can be contrasted with jsj, 
which required slightly less than 1 ab" 1 . 

C. Zh with invisible Z: (vv)(bb) 



At low pt, processes involving an invisible Z are challenging because the neutrinos are 
nearly back-to-back. In our case, they are nearly collinear, leading to striking monojet 
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FIG. 8: Reconstructed Z' transverse masses for 1, 2, and 3 TeV in the Zh — > ){bb) mode. 



signals with a single TeV-scale jet and almost no other activity. Honestly modeling the 
background in this case can be somewhat subtle, as non-gaussian fluctuations in energy 
measurements can create fake missing energy. However experimental simulation studies 
(see e.g. [3]) indicate that generic QCD backgrounds, as well as top and W backgrounds, 
can be readily controlled. Consequently, we focus exclusively on Z+jets and SM diboson 
backgrounds. 

We use our default analysis cuts, and apply no special cuts to insist on large missing 
energy. A more realistic analysis would require some kind of veto on activity opposite 
the Higgs candidate, but this would still pass the signal with very high efficiency We 
define missing energy as simply the transverse momentum vector that balances the Higgs, 
fl/rp = —p T {h). A more refined measurement would also take into account energy deposited 
by ISR jets, etc. 

Proper reconstruction of the Z' mass peak is impossible, as there is no unambiguous way 
to know the pz of the invisible Z. However, the pt of the Z' decay products has a Jacobian 
peak which is spin-enhanced. In figure [H we show the distribution of the reconstructed Z' 
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m^/ = 1 TeV 


mz' = 2 TeV 


mz' = 3 TeV 


Signal Eff. 


5.4% (1.7%) 


6.8% (2.2%) 


7.1% (2.3%) 


<?(Zj : qg -» qZ) 

a(Zj : qq ->■ grZ) 

<t(Z(W/Z)) 

cj(Z/i) 


43.0 fb (1.54 fb) 
9.2 fb (0.46 fb) 
0.87 fb (0.044 fb) 
0.073 fb (0.025 fb) 


1.42 fb (0.049 fb) 
0.38 fb (0.03 fb) 
0.064 fb (0.0024 fb) 
0.0035 fb (0.00097 fb) 


0.11 fb (0.0058 fb) 
0.037 fb (0.003 fb) 
0.0037 fb (0.00095 fb) 
0.00041 fb (0.0001 fb) 



TABLE VII: Signal efficiency and background cross sections after all cuts in Zh — > (yv)(bb). Note 
that the signal efficiency includes the branching fractions for the Z and h decays. The numbers in 
parentheses are after ^-tagging. 



Luminosity 


mz' = 1 TeV 


mz 1 = 2 TeV 


mz 1 = 3 TeV 


C = 30 fb" 1 
C = 100 fb" 1 
C = 300 fb" 1 
S/B = 1 


123.7 fb (78.6 fb) 
67.7 fb (43.1 fb) 
39.1 fb (24.9 fb) 
987.7 fb (123.8 fb) 


18.2 fb (15.5 fb) 
9.97 fb (6.64 fb) 
5.76 fb (3.83 fb) 
27.2 fb (3.8 fb) 


4.97 fb (14.8 fb) 
2.7 fb (4.4 fb) 

1.57 fb (1.48 fb) 
2.1 fb (0.43fb) 



TABLE VIII: a(Z') x BR(Z' -> Zh) required for discovery or S/B = 1 in the (vv){bb) mode. The 
numbers in parentheses are after ji-tagging. 

transverse mass, 19 

m$%, = ^p T {h) 2 + m{h) 2 + sj^ + m 2 z ~ 2p T (h). (3) 

As a signal box, we take m^ co = [100, 140] GeV and m^f, = [m z > - 30%, m z > + 10%]. 
As in the leptonic Z analysis, we consider cases with/without /i-tagging. The final signal 
efficiencies and background cross sections are shown in table IVIII The discovery reach is 
listed in table IVIIIt and displayed graphically in fig |9l along with the a x BR required to 
achieve S/B = 1. 

19 We also explored two other types of reconstruction, exploiting the fact that the Z' is produced approx- 
imately at rest in the lab frame. We considered the mass formed by assuming that the Z and h are 
produced back-to-back in three dimensions in the lab frame, as well as a mixture between this and the 
transverse mass by using the full h three-vector and the invisible Z transverse vector. Of our three Z' 
reconstruction methods, we find that the transverse mass works the best for all of the physical m z > that 
we consider. 
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0.1 1 0.1 1 

0.5 1.0 1.5 2.0 2.5 3.0 3.5 0.5 1.0 1.5 2.0 2.5 3.0 3.5 

m z (TeV) m z (TeV) 

FIG. 9: Discovery reach for Zh — > iyi>){bb) without (left) and with (right) [i-tagging. The three 
symbols (diamond, box, circle) correspond to (30, 100, 300) fb" 1 of integrated luminosity. The x 
refers to the value that is required to achieve S/B = 1. 

From the perspective of pure statistics, our analysis of Zh with invisible Z suggests that 
it may be even more sensitive than the corresponding analysis with leptonic Z, mainly owing 
to the three times higher event rate (as well as our lack of reconstruction criteria on the 
Z). Of course, this comes at a price of lower S/B, and less distinctive signal shape, as the 
kinematic recontruction is less precise. Still, /^-tagging is no longer as detrimental above 2 
TeV, for example allowing discovery of the 3 TeV warped Z' at about 300 fb _1 with S/B ~ 3. 
We believe that this mode merits a more prominent role for resonance searches at the LHC. 

V. CONCLUSIONS 

In this paper, we have explored the utility of jet substructure methods to aid in the 
search for a Z' decaying to electroweak bosons at the LHC. Our findings can be naturally 
divided into two parts. First, we have made a detailed categorization of the performance of 
the BDRS substructure algorithm for identifying boosted hadronically-decaying electroweak 
bosons in the TeV regime. Second, we have estimated the Z' discovery potential in several 
final-state channels, demonstrating significant improvements over previous analyses. 

In order to clearly justify the use of substructure, we have made a comparison between 
BDRS and more traditional jet-based reconstructions. While the difference is modest for 
genuine boosted electroweak bosons, we isolated several advantages in the treatment of the 
background. For QCD jets with mass near the electroweak boson mass, BDRS achieves 
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0(1) greater discrimination by exploiting the jets' internal kinematics. Beyond this, since 
BDRS is free of artificial AR thresholds, the background mass spectrum is free of artificial 
mass thresholds, and it is also much flatter. 

Another important observation is the stability and benefit of using a large fat-jet cluster- 
ing radius. We chose a radius of R = 1.4, which covers an area much larger than a traditional 
jet, and in particular much larger than a TeV-scale electroweak boson-jet. In some sense, 
we are no longer applying "jet" substructure but rather approaching "event" substructure. 
While the entire event is contaminated with plentiful uncorrelated radiation, we nonethe- 
less find that BDRS is extremely efficient at zooming-in to the most interesting region of 
activity. The scale-invariant picture of the event offered by this procedure offers several 
advantages. As noted above, the background mass spectrum becomes modestly-sloped and 
largely featureless. The large catchment area of the fat-jet allows a single analysis strat- 
egy to interpolate smoothly between resonance searches in the several-hundred GeV range 
up to the multi-TeV range. Finally, a nontrivial amount of global color-discrimination is 
incorporated for free. 

Ultimately, this scale-invariant picture breaks down once we hit the size of the individual 
detector elements. We incorporated into our analysis a primitive detector model which 
highlights this limitation, and suggests how to overcome it. We used the highly-segmented 
ECAL as a tracer of spatial energy flow, and incorporated the full ECAL+HCAL energy 
measurements. While a procedure like this is not so crucial for a 1 TeV Z', it becomes very 
important for 3 TeV, where the quarks from the secondary decay of a W will usually sit in 
adjacent HCAL cells. We do not expect that our model gives a truly accurate picture of the 
jet mass resolution at very high p?, but it does demonstrate that the LHC detectors have 
enough information to resolve the individual quarks. Our resonance search results should be 
largely robust if the true mass resolution can come within 0(1) of our estimate of roughly 
12%. 

For our resonance searches, we explored decays into Zh and W + W~, the former under the 
assumption of a 120 GeV SM-like Higgs decaying primarily to bb. In general, we find much 
better reach than what has previously been claimed. For example, a 3 TeV Z' with fb-scale 
cross section need not be relegated to the Super-LHC. This specific mass has become a kind 
of benchmark for composite/5D Higgs models, as a minimum to pass electroweak precision 
tests. We find that discovery should be possible with as little as 100 fb _1 in the W + W~ 
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mode, with 0(3) times higher luminosity required for Zh to catch up. 

Previous studies of Zh have focused exclusively on the (l + l~)(bb) final state utilizing 
fe-tags. It is well-known that 6-tagging performance degrades at high p?, owing to the 
increasing collinearity of hits in the inner trackers. In order to bypass the complicated 
modeling of this behavior, we have focused exclusively on a muon-based fe-tagger, which 
should extrapolate robustly to high energies. We found that this tagger improves discovery 
reach below 2 TeV, but that it becomes counterproductive at higher masses due to the 
0(1/3) signal tagging efficiency and the small number of expected untagged background 
events at normal LHC luminosities. Of course, it is possible that a more sophisiticated 
high-pT 6-tagger could perform better. 

We have also studied the largely neglected mode (uu)(bb). By reconstructing the Z' 
transverse mass, we find that this channel is highly competitive. The three times higher 
statistics appears to more than compensate for the less precise kinematics. In the case that 
a discovery is claimed of Z' —t Zh, visible and invisible Z modes will serve as important 
cross-checks of each other. 

We conclude that jet substructure techniques are very promising for identifying highly 
boosted electroweak bosons, and that these techniques can be fruitfully applied to Z' 
searches. (Though we have not explored the following point in detail, our methods can also 
be immediately applied to searches for other objects that decay into electroweak bosons, 
such as W and KK gravitons, as well as searches in weak boson fusion.) The scope of our 
analysis was limited to the case of two-body boson decays into quarks. This is adequate for 
W (and Z) bosons, but Higgses clearly present a much richer set of possible hadronic and 
semi-hadronic final states. Identifying these decays will require a broader set of substructure 
tools than what we have implemented here. We relegate their study to future work 25|, \2Q \. 
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Appendix A: Pileup and Filtering 

In the analyses of this paper, we consider Z' masses starting at 1 TeV. In this kinematic 
region, we find that the effects of underlying event contamination within the subjets are 
modest. However, searches extending to multi-TeV Z' masses will only be made possible by 
high-luminosity running, where we inevitably face the issue of approximately 20 min-bias 
pileup collisions superimposed on each Z' candidate event. Determining the best way to 
remove the pileup contamination (and the underlying event contamination at lower m#>) is 
beyond the scope of this paper. However, we do pause here to consider how large the effect 
is and how effective filtering is at correcting it. 

We simulate pileup by adding particles from PYTHIA min-bias events to our Z' events. 
The number of min-bias collisions sampled is drawn, on an event-by-event basis, from a 
Poisson distribution with mean of 20. In principle, charged pileup can be tracked back to 
event vertices distinct from the Z 1 production and subtracted, leaving over an irreducible 
neutral pileup component. On the other hand, the detectors themselves perform part of 
this subtraction "for free," by sweeping away soft charged particles in the magnetic field. 
To coarsely model this, we also consider events where all charged particles below px = 1 
GeV, including those from the primary hard collision, are removed by-hand prior to jet 
clustering. 20 To factorize this discussion from the possible degrading effect of the magnetic 
field on the mass reconstruction (as well as all other detector-related issues, which we discuss 
in appendix [Bj , we work at particle-level without altering the particles' trajectories. 

We study the effect of pileup on the reconstructed Higgs mass in the 1 and 2 TeV Z' — > 
Zh — > (l + l~)(bb) samples. (The WW mode is less sensitive to this contamination at a given 
Z' mass because Ai? su bj e ts tends to be smaller. Partially this is because the W is lighter 
than the Higgs, but also because the W is highly longitudinally polarized.) The results 
are presented in Fig. [TO] 21 The effects of pileup are, unsurprisingly, most pronounced in 

20 The magnetic field and inner HCAL radius of the ATLAS and CMS experiments are, respectively, 2.3m/2T 
and 1.8m/4T. With these parameters, charged hadrons with pr < 0.7 GeV and pr < 1.1 GeV will not 
reach the HCAL. 

Note that the uncontaminated mass peak already displays an asymmetric broadening. This is due to 
physics, not a flaw in the reconstruction. Most Higgs decays through bb contain at least one neutrino in 
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m z . = 1 TeV 



m z . = 2 TeV 




180 



m'° c ° (GeV) m'° c ° (GeV) 

FIG. 10: Distributions of the (particle-level) reconstructed Higgs mass for 1 and 2 TeV Z' — >■ Zh — > 
(l + l~)(bb) . Displayed are results without pileup (black) and with pileup (red). Results are further 
subdivided into: before filtering (solid), after filtering (dashed), and after filtering with a soft track 
cutoff (dotted). 



the 1 TeV sample. The Higgs mass peak becomes shifted upward by about 20 GeV, and 
broadened by a comparable amount. The effect is roughly cut in half by filtering, and cut 
in half again by applying the soft track cutoff. This reduces the shifting/broadening down 
to a level that is very likely below that of the realistic experimental resolution. We have 
also investigated the case with purely neutral pileup (not shown in Fig. [TO]) , and find that 
it looks quite similar to the track-cutoff case. 

The effect of pileup on background reconstruction is more subtle. QCD sub jet configura- 
tions with high mass prefer to be highly asymmetric in energy, and the most extreme cases 
are automatically ignored by the BDRS procedure. However, such configurations can be 
pushed above the symmetry threshold y cut by the addition of pileup particles to the softer 
subjet. This can be particularly problematic with our very large fat-jet size (R = 1.4). Fil- 
tering, even combined with a low-p? track cutoff, is not powerful enough to fully counteract 
this. As an example, we show in the left panel of Fig. [UJthe fn r ^ co spectrum from our Z+jet 

the final state. Some of the broadening is also due to underlying event contamination, particularly at 1 
TeV. Filtering partially corrects this off. 
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FIG. 11: Distributions of the (particle-level) reconstructed Higgs mass for Z+jets — > {l + l~)+jets 
backgrounds generated at y/§ = 1 TeV, without (left) and with (right) a symmetry cut applied after 
filtering. Displayed are results without pileup (black) and with pileup (red). Results are further 
subdivided into: before filtering (solid), after filtering (dashed), and after filtering with a soft track 
cutoff (dotted). (Omitted for clarity are no-pilup with filtering and track cutoff (black- dotted), and 
pileup with filtering and without track cutoff (red- dashed).) Note that the arbitrarily-normalized 
vertical scales on the two plots are identical. 

background generated near y/§ = 1 TeV. 22 After application of the track cutoff and filtering, 
there is a leftover enhancement of the background spectrum near 100 GeV. We find that 
the excess is dominated by the cases described above, but that these can easily be removed 
by insisting that the second-hardest subjet remain fairly hard after filtering. For example, 
imposing a post-filtering symmetry cut (filtered subjet #2 )/pr (filtered jet) > 0.1 is ade- 
quate, as shown in the right panel of Fig. [TTJ Without pileup, this cut is largely redundant 
with y cut and has small effect. The effect on the Z' signal with pileup is at the 1% level. 

22 For the purpose of this study, these events are processed without 6-tagging and without isolation require- 
ments on the leptons, the latter in order to factorize out changes in lepton identification efficiency in the 
presence of pileup. We pick a fixed slice of parton-level instead of a fixed slice of m™ in order to 
disentangle biases introduced by pileup on the latter quantity. (Such bias is largely corrected away by 
filtering, but this is difficult to see simultaneously without a full two-dimensional plot.) While we do not 
show results at above 1 TeV, we indeed find that the effects of pileup become milder, as we did for 
the Z' samples. 
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FIG. 12: Distributions of the reconstructed hadronic W mass for 1 and 3 TeV Z' — > WW —> 
(lv)(q(f)- Displayed are particle-level (black), idealized particle-flow (blue), rescaled ECAL (green), 
and pure HCAL (red). Detector models are described in more detail in the text. 



Having demonstrated that the degrading effects of pileup can probably be mostly re- 
moved for TeV-scale Z' masses, we do not incorporate pileup or filtering into our boosted 
electroweak boson studies. However, we do emphasize that any realistic broad-spectrum Z' 
search should ideally utilize a pileup/UE subtraction method that works well in the sub- 
TeV mass range but applies smoothly up to trans- TeV masses without adversely affecting 
the reconstruction there. We also note that the alternative procedures of pruning [3] and 



trimming 



63] are also worth exploring, possibly in combination with filtering and with each 



other 



80|. 



Appendix B: Detector Effects 

While we do not have the tools to fully address the impact of the detector on the quality 
of boosted boson reconstruction, we can at least make plausibility arguments and highlight 
what issues might affect the jet-mass measurement. Probably the biggest worry is that for 
TeV-scale W-jets, the spatial resolution of the detector will be too poor to resolve the mass 
peak. To get a feeling for how bad the situation might be, we consider four toy models 
(inspired by the CMS detector) with decreasing spatial resolution: 
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1. Simple particle- level as a baseline. 

2. An idealization of particle-flow: charged particles are measured perfectly, photon en- 
ergy is deposited into perfect A77 x A0 = 0.02 x 0.02 ECAL cells, and neutral hadron 
energy is deposited into perfect 0.1 x 0.1 HCAL cells. The calorimeter cells then serve 
as massless particles (with momentum vector oriented along the cell's center in rj-(f)) 
for the purposes of jet clustering. 

3. An idealized ECAL+HCAL model where ECAL cells are used to trace the energy flow. 
Photons and non-isolated electrons are deposited into the ECAL, and all hadrons are 
deposited into the HCAL. (Muons and isolated electrons are kept as tracks.) Each 
HCAL cell has an associated 5x5 block of ECAL cells. The ECAL energy is rescaled 
to match the ECAL+HCAL energy, and the HCAL cells are discarded. The rescaled 
ECAL cells are used in jet clustering. 

4. An idealized pure HCAL, where all particles (except for muons and isolated electrons) 
are deposited in 0.1 x 0.1 cells. 

We apply these four models to the 1 and 3 TeV Z' — > WW — > (lv)(qq') signal samples, as 
shown in Fig. [Tj?J As one would expect, the impact of the detector model becomes greater 
at higher Z' mass. For 1 TeV Z', the typical A_R su bj C ts is about 0.35, and our detector 
segmentation is not a major issue. For 3 TeV Z', the typical Ai? su bj e t s is about 0.12, barely 
large enough for the two subjets to sit in individual HCAL cells. Nonetheless, even the pure 
HCAL detector manages to reconstruct the W peak, with about 15 GeV resolution. The 
simple trick of using the ECAL to trace the energy flow improves resolution by nearly a 
factor of 2, down to 8 GeV, and idealized particle flow incorporating the tracker could in 
principle reduce the smearing to the 5 GeV level. In what follows, we will take the rescaled 
ECAL detector as our working model, as a compromise. 

Of course, even this model, which neglects the possible utilization of the tracker, is likely 
optimistic. This is mainly for three reasons: the magnetic field will to some extent spread 
out the charged particles, calorimeter energy deposits will spread laterally as the showers 
develop inside the cells, and energy measurements in individual cells are subject to sampling 
errors and electronics noise. We address each of these in turn. 
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To coarsely model the effect of the magnetic field, we rotate the momentum vectors of 
charged particles in so that they are oriented toward the point at which they would impact 
the innermost part of the calorimeter on an arced trajectory. We take an inner calorimeter 
radius of 2 meters, and a magnetic field strength of 4T. Particles of pr < 1.2 GeV spiral-out 
and are lost. Using the rescaled ECAL model and 1 TeV Z', the reconstructed W mass is 
shifted upward by about 3 GeV and broadened by a comparable amount. As this is a small 
effect, and can probably be largely corrected off using particle flow techniques, 23 we continue 
to model the detector without a magnetic field in what follows. 

The energy deposited in the calorimeter from a given particle is not usually contained 
in a single cell, but will leak to some extent into neighboring cells. To get a sense for how 
sensitive we might be to this leakage, we rerun the 3 TeV Z' sample through the rescaled 
ECAL model with ECAL and HCAL cells both doubled in lateral size. The net effect is a 
roughly 50% increase in the m^> co smearing, to about 12 GeV. This is comparable to the 
pure 0.1 x 0.1 HCAL model, and is still very reasonable. 24 

Finally, we also consider the the effects of smearing out the energy measurements. The 
energy resolution of subjets has not been well-studied in full simulation, so as a rough 



guess we simply use the resolution curve for R = 0.5 cone jets in the CMS detector [8 If : 
AE/E = 5.6/ E © 1.25/y/E © 0.033, with © indicating quadrature sum and E measured 
in GeV. The effect on the rescaled ECAL results at 3 TeV is modest, whereas the relative 
effects at smaller Z' mass are more pronounced. The net result is that the W mass resolution 
comes out to about 10 GeV (12%) for all of our Z' samples. The spatial segmentation effects 
and energy-sampling effects roughly compensate for each other. 

Our nominal choice of detector model for the Z' search is the rescaled ECAL model 
with energy smearing applied as above. This roughly approximates some of the dominant 
resolution effects. We note that an ATLAS substructure-based study of a 1.1 TeV WW 



23 The largest degrading effect is from the softer particles, which separate out from the core of the jet. These 
are less susceptible to crowding of hits, which can frustrate precision tracking. 

24 A further concern is that a very energetic QCD background jet will appear artificially massive due to its 
energy spreading into several cells. For example, a very tight 1 TeV jet that shares its energy equally 
between two adjacent 0.1 x 0.1 HCAL cells would appear to have a mass of 50 GeV, even if its true mass 
is much smaller. Incorporating the energy pattern in the ECAL then becomes crucial. As long as this is 
done, we do not expect that spatial smearing effects will have such a significant impact on the background 
jet-mass distribution. 
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resonance in vector boson fusion [30] quotes a resolution for of 7.4 GeV, which is 

better than what we obtain with our model. However, a full simulation study at higher 
resonance mass is clearly warrented. 
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